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An extremal problem in uniform distribution 

theory 


Dedicated to Harald Niederreiter on the occasion of his 70th birthday 


V. Balaz, M.R. laco, O. Strauch, S. Thonhauser, R.F. Tichy * 


Abstract 


In this paper we consider an optimization problem for Cesaro means 
of bivariate functions. We apply methods from uniform distribution 
theory, calculus of variations and ideas from the theory of optimal 
transport. 

1 Introduction 

In a series of papers J.G. van der Corput [MUST] systematically investigated 
distribution functions of sequences of real numbers. More recently, the study 
of distribution functions was extended to multivariate functions by the Slovak 
school of O. Strauch and his coworkers; see m [21 El [23]. These investigations 
include the study of the set of all distribution functions of a given sequence 
and various optimization problems. 

A particularly interesting problem is the study of extremal limits of two- 
dimensional sequences of the form 



( 1 ) 



n=l 
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where (x„)ngN) (?/n)nGN are uniformly distributed (for short u.d.) sequences 
in the unit interval and F is a given continuous function on [0,1]^; see |15| . 

Let us recall that a sequence (xn)neN of points in [0,1[ is said to be u.d. 
if and only if 

1 ^ 
n=l 

for all intervals [a, 5[C [0,1[, where 1e denotes as usual the indicator function 
of the set E. We refer to [5l|l2l[22] as general references on the subject. 

A mapping T of the unit interval into itself is called uniform distribution 
preserving (for short u.d.p.) if the sequence (T(xn))ngN is u.d. whenever 
(xn)ngN is a U.d. Sequence in [0,1[. 

These maps have been extensively studied (see e.g. mm), also in connection 
to variational problems |21) and extended to compact metric spaces |24| . 
They are particularly interesting for the purposes of this paper since they 
can be thought of as suitable measure preserving rearrangements of the unit 
interval, as we will see in the next section. 

It turned out that the study of the asymptotic behaviour of mean values m 
is equivalent to hnd optimal bounds for Riemann-Stieltjes integrals of the 
form ^ ^ 

[ [ F'{x,y)dC{x,y), (2) 

Jo Jo 

where C is the asymptotic distribution function of the sequence (x„,y„)ngN 
and is usually referred to as copula (see El)- More precisely, a 2-copula is 
a function C: [0,1]^ —[0,1] satisfying the following properties: for every 
x,y e [0,1] 

C'(x,0) = ^(0,^) = 0, 

C{x, 1) = X and (7(1, y) = y, 

and for every xi,X 2 ,yi,y 2 € [ 0 , 1 ] with X 2 > xi and y 2 > yi 

C{x 2 ,y 2 ) - C{x 2 ,yi) - C{xi,y 2 ) + C{xi,yi) > 0. 

An important property of copulas, which makes the expression in ([2]) mean¬ 
ingful and which can be derived from the above properties, is that every cop¬ 
ula C induces a doubly stochastic measure 7 c (later on denoted by 7 when 
there is no possibility of confusion) on the measurable space ([ 0 ,via 
the formula 

7 ([o, b] X [c, dj) = (7(5, d) — C{b, c) — C{a, d) + C{a, c). 
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Moreover, there is a one-to-one correspondence between copulas and dou¬ 
bly stochastic measures. For every copula C, the measure 7 is doubly 
stochastic in the sense that for every Borel set B C [0,1], 7 ([ 0 ,1] x B) = 
7(5 X [0,1]) = X{B) where A is the Lebesgue measure on [0,1]. Conversely, 
for every doubly stochastic measure /r, there exists a copula C given by 
C{u,v) = /r(([0,u]) X ([0,u])). Clearly, a probability measure on ([0,1]^,;B) 
with uniform marginals is doubly stochastic. We refer to Elm EH for de¬ 
tails. 

With a slight abuse of notation we write ^{x, y)dC{x, y) for the inte¬ 
gral with respect to the measure 7 . 

It should be remarked that copulas are very popular in applied proba¬ 
bility. In particular, they are used in hnancial mathematics for modeling 
dependency structures among different kinds of risks; see EESEZIES]. 

Notice, however, that in the calculation of these integrals one can also 
take advantage of the probabilistic interpretation of a copula, as the joint 
distribution function of a random vector with uniform marginals. More pre¬ 
cisely, consider a random vector {Xi,X 2 ) and suppose that its marginals Fi 
and F 2 are continuous. By applying the probability integral transformation 
to each component, the random vector 

(C/i,C/2) = (Fi(Xi),F2(X2)) 

has uniform marginals. The copula of {Xi,X 2 ) is then defined as the joint 
cumulative distribution function (cdf) of {Ui,U 2 )- 

C{ui,U2) = P{Ui < Ui,U2 < U 2 ). 

In this setting, the notion of optimal coupling can be stated. Let y,, v be 
two probability measures on (M, F). Coupling [i and v means constructing 

two random variables X\^X 2 on M in such a way that X\ ~ y, and X 2 ~ v, 
with ~ denoting equality in distribution. The couple {Xi,X 2 ) is called a 
coupling of (/r,z^). Now, if we introduce a cost function c{x,y) on M x M, 
then the problem of finding 


infE(c(Xi,W2)) 

where the pair {Xi,X 2 ) runs over all possible couplings of (^, is called 
c-optimal coupling or Monge-Kantorovich mass transportation problem (see 
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e-g- |18U28) ). Equivalently, this problem can be stated in terms of measures, 

inf / c{x,y)dP{x,y), 

where the infimum runs over all joint probability measures P on M x M with 
marginals y and ly. For connections of extremal limits and copulas to the 
theory of optimal transport we refer to [10| . 

Through this paper we consider the equivalent sup problem 

sup{E(c(Xi, X 2 )) \Xi, X 2 couplings of/r, with Pxi = l^andPx^ = t^}- 

(3) 

Furthermore, we focus our attention on couplings between uniform distribu¬ 
tions. In Section 4 we use this approach for solving a specific instance of 
maximizing the integral ([2|) . 

A useful criterion for checking the optimality of a candidate solution is 
based on the notion of c-convexity. A function / : A ^ R is called c-convex 
if it has a representation f{x) = supy{c(x,y) -|- a{y)}, for some function a. 
The associated c-subdifferential of / at x is then defined through: 

dcf{x) = {y I f{z) - f{x) > c(z, y) - c{x, y)\/z eX} 

and dcf = {(x, y) € X xY\y e dcf{x)}. 

Notice, see |18) . that y € dcf{x) if and only if 3a(= a{y)) € R such that 
y) + a{y) = f{x) and = c(C, y) + a{y) < /(O, 

(4) 

for every ^ & X. 

The dual problem of (l3|) is given by: 

hidfj, + j h 2 du \ c < hi P h 2 , hi ^ (y,) and /12 G (zz) , 

and its study is the basis of the following theorem. 

Theorem 1 (Th. 4.7 from [IS]). Let c be sueh that c{x,y) > a(x) -|- b(y) 
for some a G L^{y), b G and assume finiteness of 1(c). Then a pair 

(Ai, A 2 ) with Xi ~ y, X 2 ^ u is an optimal c—coupling between y and u if 
and only if 

{Xi,X 2 )€dJ a.s. 

for some c-convex function f, equivalently, X 2 G dcf{Xi) a.s. 


I{c) = inf 
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2 Main results 


As already pointed out in [7j, the solution of problem ([2|) depends on the 
sign of the partial derivative D 2 = ^ dxdy ^ ' Special cases have been already 
studied in the literature, like those described in Fig.l and Fig. 2. (see m)- In 
particular, the upper and lower bounds for the first case are given precisely by 
the Frechet-Hoeffding bounds, while in the second case the authors provide 
a criterion [71 Theorem 7] to find the corresponding extrema. 

( 0 , 1 ) ( 1 , 1 ) ( 0 , 1 ) ( 1 , 1 ) 


£>2 < 0 


£>2 > 0 

( 0 , 0 ) ( 1 , 0 ) ( 0 , 0 ) ( 1 , 0 ) 

Figure 1. Figure 2. 

In this paper we maximize ([2]) in the special situation described in Fig. 3 as 
a problem of optimal coupling (see [25]) and we provide a criterion for the 
instance of Fig. 4. 


( 0 , 1 ) ( 1 , 1 ) 



Figure 3. 


( 0 , 1 ) ( 1 , 1 ) 


D 2 > 0 


D 2 > 0 


D 2 < 0 



( 0 , 0 ) ( 1 , 0 ) 


Figure 4. 


We start by determining the copula which maximizes ([2]) when the sign of 
the second derivative changes as described in Fig. 4. We apply the following 
criterion [71 Theorem 7]. 


Theorem 2. Let us assume that a copula C(x,y) maximizes the integral 
Iq F{x,y)dC{x,y). Let [Ai,X 2 ] x [Yi,y 2 ] be an interval in [0,1]^ such 
that 

C{X2,Y2) + C{Xi,Yi)-C{Xi,Y2)-C{X2,Yi)>0 (5) 

and such that for every interior point (x, y) the mixed second derivative D 2 
has constant sign. Then we have: 
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(i) if D 2 > 0, then 


C{x, y) = min(C(x, Y 2 )+C{Xi,y)-C{Xi,Y 2 ),C{x, Yi)+C{X 2 ,y)-C{X 2 ,Yi)), 

( 6 ) 


(ii) if D 2 < 0, then 


C{x, y) = max(C'(x, Y 2 )+C{X 2 ,y)-C{X 2 , Y 2 ), C{x, Yi)+C(Xi, 2 /)-C(Xi, n)), 

(7) 

for every (x,y) G [Xi,X 2 ] x [Yi,l 2 ]- 

This result can be illustrated by the following pictures, where the symbols 
© and 0 in a corner mean that the value of C in that point is taken with 
positive and negative sign, respectively. 


(Xi,y2) (x,F2) {X2,Y2) 

s ^ 

(Xi,y)0- (X2,y) 





© 



(x. 

y) 

d: 



G 


{X,,Yi) (x,Fi) (X2,yi) 

Figure 5. 


{Xi,Y2) {x,Y2) {X2,Y2) 

{Xi,y) 


(x, 

a_a 


y) 


^{X2,y) 


(Xi,yi) (x,Fi) {X2,y,) 

Figure 6. 


In order to apply this criterion to the case described in Fig. 4 we divide the 
unit square [0,1]^ into [0,xi] x [0,1], [xi,X 2 ] x [0,1] and [x 2 ,1] x [0,1], as 
can be seen in Fig. 7. 

Then, following the above statement, if x G (0, xi) U (x 2 ,1), we apply ([6]) in 
the same way as in Fig. 5 and if x G (xi,X 2 ), we apply ([7]) as in Fig. 6. 


6 








D2 > 0 D 2 < 0 D 2 > 0 



Consequently, the following Theorem holds true. 

Theorem 3. Let 0 < xi < X 2 < 1 and 

Fi{x,y) i/x € (0,xi), > 0, 


F{x,y) = < 


dxdy 

F2{x,y) ifx £ (xi,X2), ^ < 0, 


F2,{x,y) i/x € (x2,l),^^||^ > 0. 

Then the copula maximizing F{x,y)dC(x,y) has the form 

min(x,/ii(y)) if x G [0,xi], 

C{x,y) = max(x + h 2 {y) - X 2 , hi{y)) ifx G [xi,X 2 ], 

_min(x - X 2 +/r 2 (y),y) i/xG[x 2 ,l], 


( 8 ) 


(9) 


where hi{y) = C{xi,y), and /i 2 (y) = C{x 2 ,y). 

As we will see below, this result implies that in an ideal situation the 
problem is reduced to the determination of suitable functions hi and /i 2 . 


Before going on we need to determine dC{x,y) for the special situation of 
(j9]) . For this reason let us consider the rectangles 
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{x,v + &y) {x + dx,y + dy) {x - dx,y + dy) {x,y + dy) 




Figure 9. 

and the - from the copula induced - measures which are defined by 
7 c(dx, dy) = C{x, y) + C{x + dx,y + dy) - C{x, y + dy)- C{x + dx) (10) 


and 


7 c(dx,dy) = C{x-dx,y) + C{x,y + dy)-C{x-dx,y + dy)-C{x,y), (11) 

where (dx, dy) stands for the infinitesimal rectangles from Fig. 8 and Fig. 

9. 

We consider the three regions in Fig. 7 where the second derivative changes 
sign separately. 

(i) X G (0,xi). 

Then x = hi{y) and C{x,y) = min(x,/ii(y)). Thus by (fTOll 
7 c(dx,dy) = hi{y) + {hi{y)+h[{y)dy)-hi{y)-hi{y) = h'^{y)dy. (12) 


(ii) X G (xi,X2). 

Then x = X 2 - h 2 {y) + hi{y) and C{x,y) = ma:x:{x + h 2 {y)-X 2 ,hi{y)). 
Let us observe that 

C{x,y + dy) = max(x +/i 2 (y + dy) - X 2 ,/ii(y + dy)) 

= max(/ii(y) +/i 2 (y)dy,/ii(y) + h[{y)dy) 

= hi{y) + h2{y)dy, 

since for every (x,y) such that x + /i 2 (y) — X 2 = hi{y) we have ^ + 

Kiy) = Kiy) and ^ < 0. 

Similarly 


C'(x —dx,y) = max{x — dx + h 2 {y) — X 2 , hi{y)) 
= max(/ii(y) - dx,/ii(y)) =/ii(y), 
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since dx > 0. Thus from m we have 

7 c(dx,d 2 /) = hi{y) + {hi{y) + /i 2 (r/)dy) - hi{y) - h[{y) - hi{y)dy 
= {h 2 {y) - h[{y))dy. (13) 


(hi) X G (x 2 , 1 ). 

Then x = X2 — h2{y) + y and C{x, y) = min(x — X2 + h2{y),y)- Let us 
observe that 

C{x + dx,y + dy) = min(x + dx - X2 +/i2(y + dy), y + dy) 

= min(y + dx + /i2(y)dy, y + dy) = y + dy, 

since for every (x,y) such that x — X2 + h2{y) = y we have dx + 

^2(y)dy = dy. 

Moreover 

C{x,y + dy) = min(x - X2 +/i2(y + dy), y + dy) 

= min(y + /i2(y)dy, y + dy) = y + /i2(y)dy, 

since /i2(y) < 1 and 

^(x + dx, y) = min(x + dx - X2 + /i2(y), y) = min(y + dx, y) = y, 

since dx > 0 . Therefore with ( 1101 ) we arrive at 

7c(dx,dy) = C{x,y) + C{x + dx,y + dy) - C'(x,y + dy) - C{x + dx) 
= y + y + dy-(y + /i'2(y)dy) - y = (1 - h'2{y))dy. ( 14 ) 


Altogether the measure 7c of the infinitesimal rectangles and hence dC{x, y) 
is given by 


7 c(dx,dy) 


'h[{y)dy 

< (h'^iy) - h[{y))dy 

.(1 - ^2(y))dy 


if X G [0,xi],x = hi{y), 

if X G [xi,X2],x = X2 - h2{y) + /ii(y), 

if X G [x2, l],x = X2 - /i2(y) + y. 

(15) 


Our next step is to identify situations in which C is indeed a copula. 


Theorem 4. The function C{x,y) defined by ^ is a copula if and only if 
(i) hi{y) and /i 2 (y) are increasing; 


(ii) /ii(0) =0, /i2(0) =0; 
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(lii) hi{l) = xi, /i2(l) = X 2 ; 

(iv) 0 < hi{y) < h 2 {y) < y; 

(v) 0 < h'^{y) < h'^iy) < 1. 


Proof. The structure of the proof is as follows: we first prove the necessary 
condition by showing that if C is a copula, then properties (z) — (v) are 
satished. Then we exploit these properties to show that C is a copula. 

Let C{x,y) be a copula and hi{y) = C{xi,y) and h 2 {y) = C{x 2 ,y). 
Properties (i) — {in) are straightforward. In order to prove [iv), let us 
consider the rectangle \x, 1] x [0,1]. Since C{x,y) is a copula, we have 


C{x, 0) + C{1, yi) - C(x, yi) - C{1, 0) = ?/i - C{x, yi) > 0, (16) 

and thus yi > C{x,yi). 

We proceed in a similar way to prove (u). Let us consider the rectangle 
[x, 1] X [?/i,?/ 2 ]- For an arbitrary copula C{x,y) we have 

C{x, yi) + C(l,y 2 ) - C{x, 7 / 2 ) - <^(1, yi) = C{x, yi) + y 2 - C{x, ^ 2 ) - 2 /i > 0. 

(17) 

Then 

y 2 -yi> C{x, y 2 ) - C{x, yi) 

and thus C'{x,y) < 1 a.e.. This implies h'{y) < 1 a.e. for h{y) = C{x,y) 
(see also m Theorem 2.2.7.] ). Furthermore 

h'i{y) < h^iy) 


since for every {x,y) such that x + /i 2 (y) — X 2 = hi{y) with x G [xi,X 2 ] we 
have I + h'(y) = h[{y) and g < 0. 

On the other hand, it follows from (u) and (I15jl that 7 c'(dx, dy) is nonnegative 
for every (x, y) G [0,1]^ and by Q that C(x, 0) = (7(0, y) = 0. Thus C{x, y) 
is a distribution function. We need to show that C{x, 1) = x and (7(1, y) = y 
for every (x,y) G [0,1]^. Indeed we have 


C{x,l) 


min(x, hi{l)) = min(x, xf) = x 
< max(x + ^ 2 ( 1 ) — X 2 , hi(l)) = max(x, xi) = x 
_min(x — X 2 + h 2 {l), 1) = min(x, 1) = x 


if X G [0, xi], 
if X G [xi,X2], 
if X G [x2, 1]. 


For X = 1 we need 


(7(1, y) = min(l - X 2 + h 2 {y),y) = y. 


(18) 
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Since /i 2 (l) = X 2 , then (1181) is equivalent 


1 - 2/>/i2(l) -/i2 (y), (19) 

which holds true, since /i 2 (l) ~ ^ 2 ( 2 /) = (1 “ y)h' 2 {y*) for some y* € (y, 1) 
and the derivative satisfies (u). □ 

Theorem 0] implies the following bounds on candidate functions hi and 

h2. 


hi{y) = 1 

\y 

if y G 

[0,xi], 

hi{y) = 1 

fo 

- Xl) 

if y G 

[0,1 

- Xl], 

[xi 

if y G 

1 ], 


[ 2 /- (1 - 

if y G 

[1- 

Xl, 1], 

h2{y) = 1 

\y 

if y G 

[0,X2], 

^ 2 ( 2 /) = 1 

fo 

- X 2 ) 

if y G 

[0,1 

- X 2 ], 

[X2 

if y G 

[X2, 1 ], 


[ 2 /- (1 - 

if y G 

[1- 

X2, 1], 

where 











hi{y) < hi{y) < hi{y), hgiy) < h 2 {y) < h 2 {y)■ (20) 


Now, we return to the integral (j^. 
Theorem 5. Let us define a funetion G by 


G:=G{y,hi,h 2 ,h[,h' 2 ) = 

= Fi{hi{y),y)h'i{y) + F 2 {x 2 - h 2 {y) + hi{y),y){h 2 {y) - h'i{y)) 

+ Fs{x 2 - h 2 {y) + y,y){l - h^iy)). (21) 

If hi,h 2 maximize Gdy and satisfy Theorem^ then 


1 /•! 


max 

C{x,y)— copula Jq Jq 



F{x,y)dG{x,y) = / Gdy. 


If not, then we only have the following inequality 

"1 ri 


max 
C{x,y)&CjQ Jq 


F{x,y)dG{x,y) < / Gdy. 


( 22 ) 


(23) 


The class C is the set of all copulas of the form 
assumptions of Theorem^ 


with hi, /i 2 fulfilling the 
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Proof. Let F be a function defined on [0,1]^ such that D 2 = ^ changes 

its sign as indicated in Fig. 4. Then the two-dimensional Riemann-Stieltjes 
integral of F with respect to the copula C defined in ([9]) is given as follows 


f [ F{x,y)dC{x,y)= [ Fi{hi{y),y)h'^{y)dy + 

0 Jo Jo 

+ f F2ix2-h2{y) + hi{y),y){h2{y)-h[{y))dy 

Jo 

+ [ F 3 {x 2 -h 2 {y)+ y,y){l - h 2 {y))dy (24) 

do 

1 

Gdy . (25) 

Since under the assumptions of Theorem HI , C is indeed a copula, the rep¬ 

resentation from Theorem 3. implies optimality. The second statement is 
obvious, since the class C is a subset, due to additional restrictions, of can¬ 
didate functions /ii, /i 2 - □ 

Remark 1. Note that to compute extremes of G{y,hi.,h 2 ,h'.^,h' 2 )dy we 
can apply calculus of variations (cf |291 p. 33]). In particular, if (/ii,/i 2 ) are 
extrema for the integral G{y,hi.,h 2 ,h'i,h' 2 )dy, then (/ii,/i 2 ) satisfy the 
Euler-Lagrange differential equations 



dG _ d dG 
dhi dy dh'.^ 
dG _ d dG 
dh 2 dy dh '2 


(26) 


The solution (/ii,/i 2 ) to (I26p 


maximizes fg G{y, hi, / 12 , h'^, h 2 )dy if 


dh[dh[ - ’ 


d'^G 

dh'.dh'^ 

d^G 

dh^dh'^ 


d'^G 

dh\dh'2 

d^G 

'dh^dh^ 


< 0 . 


(27) 


Remark 2. From the optimal copula with representation Q and the prop¬ 
erties of hi and /12 from Theorem HI we can derive the solution of the problem 
in the vocabulary of optimal couplings as well. Notice that for x € [0,xi), 
{x,y) is mapped to {hi{y),y). According to Theorem HI hi is monotone 
increasing and admits an inverse gi- For x G [xi,X 2 ), we have {x,y) is 
mapped to {x 2 - (^ 2 ( 2 /) - hi{y)),y), where X 2 - {h 2 {y) - hi{y)) is monotone 
decreasing in y with inverse function g 2 . Finally, for x G [x 2 ,l], we have 
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(x, y) !-)• (x 2 + y — h 2 {y),y), with X 2 + y — h 2 {y) increasing in y and inverse 
53 . Therefore we can identify the optimal coupling {U,T{U)) for U uniformly 
distributed on [ 0 , 1 ] and 


[ 9i{x), 

X G [0,xi), 

r(x) = g 2 {x), 

X G [X1,X2) 

1 53 ( 3 ;), 

X G [X2, 1]. 


3 A piecewise linear cost function 


Let 


F{x,y) = < 


Fi{x,y) = ^y 
F2{x,y) = £^y 
[F3ix,y) = ^y 


with x-component as shown in Fig. 10. 


X G (0,xi), 

X G (X1,X2), 

X G (X2, 1), 



\ X'?,—X 



\ X 2 —X 1 


/ ^ 


/ x—x-?. 

/ 


j 1 — X2 


D Xi X 2 ^1 

Figure 10 . 

Towards the construction of G from (I2ip . we identify 


Fi{hi{y),y)h'i{y) = —yh[, 

Xl 

F2(x2 - h2{y) + /ii(y),y)(/i2(y) - h\{y)) = — — —yih'^ - h[), 

X2 — Xl 

-^3(3^2 - h2{y) +y,y)(l - h'^iy)) = ^ — —y{l - K), 

1 — X2 


such that G takes the form 


G 


-yh'i + 


Xl 


/i2 — hi 
X2 - Xl 


y{h'2 


h'l) + 


y -h 2 

1 - X2 


2/(1 - h2). 


(28) 
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The associated Euler-Lagrange equations are given by 


dG d dG h\ h'n-h\ /12 - hi 

— = —v- - - —y - — + - -— = 0 , 


dhi dy dh'i xi X 2 — xi xi X 2 — xi 

dG d dG K-h\ l-K hs - hi 

-= — - -y - -y -1 

dh 2 dy dh '2 X 2 — xi 1 — X 2 X 2 — xi 1 — X 2 1 — X 2 


y 


Now, adding 


and (|30p and multiplying the sum by we get 


h'l 


Multiplication of 

,, {X2-Xi 


I- X2 

Xl 

with 


+ h '2 = — 

y 

gives 


hi /1 - X2 


Xl 


+ —- 1 . 
y 


(29) 

= 0 . 

(30) 

(31) 


y 


h 


Xl 


+ 1 ) - h' = ^ 
y 


hi f X2- Xl 


Xl 


1 h 2 

+ 11 . 

y 


Summing up (IHTI) and 


and hence 


we find 


hi 


, / n-l 

hi =-Xl 

y 


,, h 2 

ri2 = - X2 


The general solution h{y) of the differential equation 

h 


has the form 


h' = -X, 

y 


h{y) = cy -xy logy. 


(32) 

(33) 

(34) 

(35) 

(36) 


From the boundary conditions h(l) = x and h(0) = 0, we find h{y) = 
xy{l — logy) and h'{y) = x(—logy). Thus 


hi{y) = xiy{l - logy), h 2 (y) = X 2 y(l - logy). 


(37) 


which unfortunately do not satisfy condition (v) in Theorem 01 Finally, G 
is given through 


G = y log y 


X2 


1 - X2 


+ 2/^(logy)^ 


X2 


1 - X2 


+ r 
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which yields the value 


( 38 ) 



X2 

1 - X2 



On the other hand, if hi(y) = xiy and /i 2 (y) = X 22 /, then G = y"^ and 
fg Gdy = Thus (I37p does not maximize Gdy. 

Note that for F{x,y) = f{x)y, with /(x) uniform distribution preserving 
map (u.d.p.) we have 


1 


N 


1 /•! 


max lim — f{xn)yn = max 
are u.d. N^oo N ^^ C{x,y)— copula, Jq Jq 



max _ 

2 —u.d,u.d.p. N^oo I\ 


F{x,y)dG{x,y), 

(39) 

y^/(®n)^(xn) = max / /(x)$(x)dx. ( 40 ) 

/— /V ^ ^ a)_n r] r\ I 


n=l 


U.d.p. Jq 


Then (j40)) is in general smaller than (I39p . By [3 Corollary 3] for u.d.p. f{x) 
we have ^ ^ 

max / / f{x)^{x)dx = (41) 

$-U.d.p. Jo Jq 3 

Therefore in our situation we have F{x, y)dG{x, y) > 

Remark 3. The example points out the deficiencies of the variational formu¬ 
lation in the present context. When maximizing ([2]) it is essential to preserve 
the uniform distribution property of the marginals. In the formulation via 
the function m, which led to the problem from the calculus of variations, 
this constraint is not present anymore and the maximization takes place over 
differentiable hi, / 12 . On the other hand the optimal copula C in ([9]) with 
hi{y) = G{xi,y) and h 2 {y) = G{x 2 ,y) does not enforce any smoothness 
properties, which implies that when solving (j2Tp one does not necessarily 
get an upper bound for ([2]). The same reasoning suggests that when maxi¬ 
mizing m over differentiable functions, which fulhll the conditions stated 
in Theorem m one in general derives a lower bound for ([2)). 


4 A different approach using coupling 

In this section we consider the example 

F{x, y) = sin(7r(x -|- y)) 
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and relate this problem to combinatorial optimization. In |9] an upper bound 


for 


/ sin(7r(x + y))7(dx, dy) 
d[0,l]2 


(42) 


was found by means of the Hungarian Algorithm and we will show that the 
copula found in |2] is indeed the one maximizing (|42p . 

For dealing with this particular example one may utilize Theorem 1. 
from |25| . But since the proof of what we actually need is not given there, 
we re-state the following particular version of this result and give its proof. 

Theorem 6. Let /r, u he the uniform distribution on [0,1] and the cost func¬ 
tion c(x, y) = 4>{x -\- y) with (f : [0, 2] —R. In particular we assume that 
(j) G C^[0,1] and that there is k & ( 0 ) 2 ) such that (p'fx) < 0 for x G [0,/c) 
and 4>"{x) > 0 for x G (A:, 2]. If 13 (0,1) denotes the solution to 




then 



13-X, X € [0,/3), 
X, xG[/3, 1], 


induces by {U,T(U)) for some standard uniformly distributed U an optimal 
c-coupling between P and Q. 


Proof. For the proof we proceed as proposed in |25| and [20) . Define the 
following functions: 


/i(x) = x(p'{/3), 

f2ix) = ^(<('(2x) - 0(2/3)) /30'(/3), 

0^0 = 0(/3 - X 0 + x(j)'{f3) - 0(/3), 

0^(0 = <Pix + 0 - ^0(2a;) - ^0(20) + 00'(0). 


Furthermore set 


fix) = /l(x)/[o,/3)(x) -b / 2 (x)/[^^i](x), 


and put for ^ G [0,1]: 



X G [0,0), 
X G [0,1]. 
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Here 'ipr{x){0 plays the role of 'ipy,a{0 = c{C,y) +«(y) with y = r(x) in ([J]). 
Now the idea, following Theorem [1] and (|3]), is to show that y = r(x) is in 
the c-subdifferential of f{x) for all x € [0,1] which implies optimality of this 
particular coupling and optimality of the distribution induced by {U,T{U)) 
for the transport problem. For the c-convexity of / and the subdifferential 
property we need to show: 

'>Pr{x){x) = f{x) Vx€[0 ,1], 

V’r(x)(e) </(0 VeG[0 ,l]. 

We start with showing that 'ilJr{x)ix) = f{x). For x € [0,/3) we have that 
r(a:) = 13 — X and 

'^r{x)ix) = 'ip^ix) = x(j)'{(3) = fi{x) = f{x). 

For X G [/?, 1] we have r(x) = x and 

y!’r(x)(x) = = ^((^(2x) - (^(2/3)) + /30'(/3) = f 2 (x) = f(x). 

It remains to show 'i/’r(x)(0 ^ /(O foi" all (x,^) G [0,1] x [0,1]. 

At first we need some details on the location of (3. From the definition of 
j3 and the mean value theorem we obtain 3r G {/3,2(3) with (p'ir) = 4>'{l3). 
Since cj) is concave on [0, k) and convex on {k, 2] we see that (3 < k < t < 2(3. 
In the following we distinguish four situations. 

• For X G [0, (3) and ^ G [0, (3) the functions are given by 

'0r(x)(O = 4’iP-x + C) + x(j)'{/3) - (p{(3), /(O = ?0'(/3)- 

Therefore we need to check: 

/(O - VT(a:)(0 = (^ - x)(j)'{l3) - {(p{(3 + ^- x) - (p{(3)) =: Fi{x,C). 

(43) 

Obviously Fi{x,^) = 0 for {0,(3), (0,0) and {x,x). In general we have 
by concavity on [0, k) and the definition of (3, 4>{2(3) = 4>{(3) + (34>'{(3) 

HP) + {P + ^-x- P)((>'{P) > (/>(/3 + C - x), 
since (3 + ^ — x G [0, 2(3], which proves that (I43p is positive. 
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• For X G [0, (3) and ^ G [/3,1] the functions are given by 

V’r(x)(C) = 4>{P-x + C)+ x(j)'{(3) - 4>{f3) , 

We need the following to be positive: 

^(0(20 - 4>{W)) + /3</>'(/3) -4>{l^ + i-x)- x(j)'{f3) + (p{f3) 

= ^</>(20 + ^'/>(2/3) - x(j)'{f3) -(p{f3 + ^-x) 

> (P{^ + P) - x(p'{f3) -(j){f3 + C-x) =: F 2 (x, 0 , 

where the equality above follows from the definition of /3 and the in¬ 
equality follows from convexity since k < 2(5 <2^. Observe F 2 {x,^) = 

0 for (0,^) and (/3,/3). To prove positivity the strategy is as follows, 
firstly show F 2 {x,^ = /3) > 0 for all x G [0,/3) and secondly show that 
§^F 2 {x,C) > 0 for all (x,^) G [0,/3) x (/3,1). 

Look at 

^F2(x,C = /3) = -0'(/?) + <^'(2/3 - x) 

which is zero in (0, (5) exactly if x = x = 2/3 — r < /3. Since 0 < 
0 "(t) = -^F 2 {x,C = I3)\x=x we have that 

F2{x,( = /3) = (j){2(3) - (2/3 - t)(/)'(t) - 4>{t) > 0 

is a maximum, it is positive by convexity {k < t < 2/3). Thus, 
T 2 (x,^ = /3) > 0 for X G [0,/3). Now we can deal with checking 
the interior, 

^F2(x, 0 = </>'(/3 + 0 - </>'(/? + e - x). 

Suppose f3 + ^ — X > k, then by convexity 4>'{(3 + ^) > 4>'{(3 + ^ — x). 

On the other hand if/3-|-C“3 :</cwe have (p'(/3 -|- C ~ < 4>'iP) = 

< <^^(2/3) < (/)'(/3 -|- since 2/3 < /3 -|- C- 

• Consider x G [/3,1] and ^ G [0,/3), here 

V’r(x)(C) = 4>ix + 0 - ^4>{‘^x) - ^(/>(2/3) (3(p'{f3), /(O = /i(0 = C0'(/3), 
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and we need 


- 4>{x + 0 + + ^(/>(2/3) - f3(p'{P) 

> 4>{x + 13)- (p{x + 0 - (/3 - 0^'iP) =■ P 3 ix, 0 

to be positive, the inequality stems from convexity since 2x > 2/3 > k. 
We proceed as before. i^ 3 (x,^) = 0 for {x,l3) and (/3,0). At first fix 
X = f3, F^i^x = f3,y) = 0 if y = 0 and y = (3. In between we study 

4i"3(x = /3,O = -0'(/? + O + </>'(/3), 

which is zero in (0, /3) exactly if ^ ^ = r — /3. Again due to convexity 

of (/) on (fc, 2] we have a maximum in ^ and 

F^ix = /3,|) = (t>{2(3) - 4>{t) - (2/3 - r)(/>'(r) > 0, 

such that we have i^ 3 (x = /3,^) >0 for ^ G [0,/3). On the interior we 
show that 

^-^ 3 ( 3 ^, 0 = (t>'{(3 + x) - 4>'ix + ^) > 0. 

If X + ^ > /c we have from convexity 4)'{l3 + x) > (j)'{x + ^). If x + ^ < /c 
we have /3<x + ^<A:<r<2/3</3 + x and therefore 

(t>'ix + 0 < ^'(P) = fp'ir) < (/)'(2/3) < 0'(/3 + x). 

• Let X G [/3,1] and ^ G [/3,1]. Here 

V'r(a:)(0 = (t>{x + 0 - ^4>i‘^x) - ^(j){2l3) + f3(j)'{l3) , 

such that 

/(O - = ^^i‘^x) + ^(/)(20 - (pix + 0 > 0> 

is fulfilled by convexity since x+^, 2x, 2^ > 2/3 > k. 

We can conclude that '4’r{x)i0 — fiO ^ (x,^) G [0,1] x [0,1], which accord¬ 
ing to Theorem [U shows that the vector (JJ, '+{17)) for U ~ W([0,1]) yields an 
optimal coupling. □ 
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Remark 4. If /3 > 1 then it can be shown as in the hrst step of the above 
proof that (U, 1 — U) yields the optimal coupling. Loosely speaking one could 
say that the concave behaviour dominates the convex one. 

Now we are prepared to answer the sine question. Setting (j){z) = sin(7r2) 
and A; = 1 we immediately get: 


Corollary 1. For c{x,y) = sin(7r(x + y)) we have that the distribution of 
the vector {U,T{U)) for U ~ f^([0,1]) with 



and (5 = 0.7541996008265638 « 0.7542 (cf fDf) which solves 


sin(27r/3) — sin(7r/3) = /^tt cos(7r/3), 


(44) 


IS maximizing 



in the set of all bivariate distributions C with uniform marginals, i.e., in the 
set of all copulas. 

Remark 5. In this situation equation (14411 meets the first order condition 
when looking at couplings of the form (17, r“(17)) with 



a — X, X € [0, a), 
X, X G [a, 1], 


or explicitly maximizing (c{x,y) = sin(7r(x + y)) 



1 


This is conjectured in [9] but with inaccurate a 


3 
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